Collaborative thesaurus tagging the Wikipedia way

نویسنده

  • Jakob Voß
چکیده

This paper explores the system of categories that is used to classi­ fy articles in Wikipedia. It is compared to collaborative tagging systems like del.icio.us and to hierarchical classification like the Dewey Decimal Classification (DDC). Specifics and common­ alities of these systems of subject indexing are exposed. Analysis of structural and statistical properties (descriptors per record, records per descriptor, descriptor levels) shows that the category system of Wikimedia is a thesaurus that combines collaborative tagging and hierarchical subject indexing in a special way.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Representational Interoperability of Linguistic and Collaborative Knowledge Bases

Creating a Natural Language Processing (NLP) application often requires to access lexical-semantic Knowledge Bases (KBs). Recently, Collaborative Knowledge Bases (CKBs) such as Wikipedia and Wiktionary1 have been recognized as promising lexicalsemantic KBs for NLP (Zesch et al., 2008b), complementing traditional Linguistic Knowledge Bases (LKBs). As CKBs differ significantly from LKBs concernin...

متن کامل

SemKey: A Semantic Collaborative Tagging System

By analysing the current structure and the usage patterns of collaborative tagging systems, we can find out many important aspects which still need to be improved. Problems related to synonymy, polysemy, different lexical forms, mispelling errors or alternate spellings, different levels of precision and different kinds of tag-to-resource association cause inconsistencies and reduce the efficien...

متن کامل

Extreme Tagging: Emergent Semantics through the Tagging of Tags

While the Semantic Web requires a large amount of structured knowledge (triples) to allow machine reasoning, the acquisition of this knowledge still represents an open issue. Indeed, expressing expert knowledge in a given formalism is a tedious process. Less structured annotations such as tagging have, however, proved immensely popular, whilst existing unstructured or semi-structured collaborat...

متن کامل

Analysis of User Behavior on MultilingualTagging of Learning Resources

Although social, collaborative classification through tagging has been the focus of recent research, the effect of multilingual tags is often overlooked. This work presents an early exploratory study of the production and consumption of multilingual tags in a European educational K-12 context. The data, produced by teachers bookmarking and tagging learning resources during three month period, w...

متن کامل

Building Language-Independent Concepts from Wikipedia

This paper describes a simple method for deriving language-independent concepts by identifying groups of pages about the same topic from Wikipedias in different languages. This allows the information about a concept obtained from different Wikipedias to be merged, and by this provides a way to determine which terms from different languages refer to the same concept. The method presented here wa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/cs/0604036  شماره 

صفحات  -

تاریخ انتشار 2006